Frequency-based Rare Events Mining in Administrative Health Data
نویسندگان
چکیده
The low occurrence rate of adverse drug reactions makes it difficult to identify risk factors from a straightforward application of association pattern discovery in large databases. In this paper, we are interested in developing a data mining approach that can use the information about rare events in sequence data in order to measure the multiple occurrences of patterns in the whole period of target and non-target data. To address this, we define an interestingness measure which exploits the difference between the frequency of patterns in target and non-target sequence data. The proposed approach guarantees the easy generation of candidate patterns from the target sequence data by applying existing association mining algorithms. These patterns can then be evaluated by comparing their frequency in the target and non-target data. We also propose a ranking algorithm that takes into account both the rank of the patterns as determined by the interestingness measure and their supports in the target population. This algorithm can prune the patterns greatly and highlight more interesting results. Experimental results of a case study on Angioedema show the usefulness of the proposed approach.
منابع مشابه
Frequency-Based Temporal Pattern Mining in Health Data
The low occurrence rate of adverse drug reactions makes it difficult to identify the risk factors from straightforward application of frequent pattern discovery in large databases. In this paper, we are interested in developing a data mining strategy that can fully utilize the information around rare events in sequence data in order to measure the multiple occurrences of patterns in the whole p...
متن کاملRare Event Analysis of High Dimensional Building Operational Data Using Data Mining Techniques
Today’s building automation systems (BASs) are becoming increasingly complex. A typical BAS usually stores hundreds of sensor measurements and control signals at each time step, which produces massive high dimensional data sets. Traditional analysis methods for BAS data only focus on a small subset of the data, resulting in a huge information loss. Data mining techniques are more effective in k...
متن کاملA Framework of Process Mining for RFID Event Analysis
As information systems and telecommunication devices are spread, many organizations accumulate a lot of events which are generated in performing business activities. The analysis of real-time data and events can play a critical role in implementing real-time enterprises and business intelligence. Recently, supply chain and manufacturing sectors have adopted ubiquitous environment that generate ...
متن کاملIdentifying fall-related injuries: Text mining the electronic medical record
Unintentional injury due to falls is a serious and expensive health problem among the elderly. This is especially true in the Veterans Health Administration (VHA) ambulatory care setting, where nearly 40% of the male patients are 65 or older and at risk for falls. Health service researchers and clinicians can utilize VHA administrative data to identify and explore the frequency and nature of fa...
متن کاملDetection of adverse drug events: proposal of a data model.
Our main objective is to detect adverse drug events (ADEs) in former hospital stays. As ADEs are rare, that supposes to screen thousands of electronic health records (EHRs). For that purpose, we need to define a data model that has two main objectives: (1) being able to describe hospital stays from various hospitals (2) being tuned so as to prepare the data mining process: as ADEs are not flagg...
متن کامل